Convex Formulation of Multiple Instance Learning from Positive and Unlabeled Bags
نویسندگان
چکیده
Multiple instance learning (MIL) is a variation of traditional supervised learning problems where data (referred to as bags) are composed of sub-elements (referred to as instances) and only bag labels are available. MIL has a variety of applications such as content-based image retrieval, text categorization, and medical diagnosis. Most of the previous work for MIL assume that training bags are fully labeled. However, it is often difficult to obtain an enough number of labeled bags in practical situations, while many unlabeled bags are available. A learning framework called PU classification (positive and unlabeled classification) can address this problem. In this paper, we propose a convex PU classification method to solve an MIL problem. We experimentally show that the proposed method achieves better performance with significantly lower computation costs than an existing method for PU-MIL.
منابع مشابه
Ellipsoidal Multiple Instance Learning
We propose a large margin method for asymmetric learning with ellipsoids, called eMIL, suited to multiple instance learning (MIL). We derive the distance between ellipsoids and the hyperplane, generalising the standard support vector machine. Negative bags in MIL contain only negative instances, and we treat them akin to uncertain observations in the robust optimisation framework. However, our ...
متن کاملMultiple-Instance Active Learning
We present a framework for active learning in the multiple-instance (MI) setting. In an MI learning problem, instances are naturally organized into bags and it is the bags, instead of individual instances, that are labeled for training. MI learners assume that every instance in a bag labeled negative is actually negative, whereas at least one instance in a bag labeled positive is actually posit...
متن کاملCS395T Data Mining Project report One-class SVM formulations for Multiple Instance learning
Multiple Instance learning (MIL) considers a particular form of weak supervision in which the learner is given a set of positive bags and negative bags. Positive bags are sets of instances containing atleast one positive example and negative bags are sets of instances all of which are negative. A number of binary SVM based solutions have been proposed to this problem like the Normalized Set Ker...
متن کاملMultiple Instance Learning for Computer Aided Diagnosis
Many computer aided diagnosis (CAD) problems can be best modelled as a multiple-instance learning (MIL) problem with unbalanced data: i.e. , the training data typically consists of a few positive bags, and a very large number of negative instances. Existing MIL algorithms are much too computationally expensive for these datasets. We describe CH, a framework for learning a Convex Hull representa...
متن کاملRisk Minimization Framework for Multiple Instance Learning from Positive and Unlabeled Bags
Han Bao The University of Tokyo, 113-0033 Tokyo, Japan [email protected] Tomoya Sakai The University of Tokyo, 277-8561 Chiba, Japan RIKEN Center for AIP, 103-0027 Tokyo, Japan [email protected] Masashi Sugiyama RIKEN Center for AIP, 103-0027 Tokyo, Japan The University of Tokyo, 277-8561 Chiba, Japan [email protected] Issei Sato The University of Tokyo, 277-8561 Chiba, Japa...
متن کامل